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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Lai, Preeti 

Bandman, Olga 

(ii) TITLE OF THE INVENTION: NOVEL HUMAN SODIUM-DEPENDENT 
PHOSPHATE COTRANS PORTER 

( i i i ) NUMBER OF SEQUENCES : 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
(C> OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0221 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 



{ 2 ) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Gin Val Asp Glu Thr Leu lie Pro Arg Lys Val Pro Ser Leu Cys 

15 10 15 

Ser Ala Arg Tyr Gly lie Ala Leu Val Leu His Phe Cys Asn Phe Thr 

20 25 30 

Thr lie Ala Gin Asn Val He Met Asn He Thr Met Val Ala Met Val 

35 40 45 

Asn Ser Thr Ser Pro Gin Ser Gin Leu Asn Asp Ser Ser Glu Val Leu 

50 55 60 

Pro Val Asp Ser Phe Gly Gly Leu Ser Lys Ala Pro Lys Ser Leu Pro 
65 70 75 80 

Ala Lys Ser Ser He Leu Gly Gly Gin Phe Ala He Trp Glu Arg Trp 

85 90 95 

Gly Pro Pro Gin Glu Arg Ser Arg Leu Cys Ser He Ala Leu Ser Gly 

100 105 110 

Met Leu Leu Gly Cys Phe Thr Ala He Leu He Gly Gly Phe He Ser 

115 120 125 

Glu Thr Leu Gly Trp Pro Phe Val Phe Tyr He Phe Gly Gly Val Gly 

130 135 140 

Cys Val Cys Cys Leu Leu Trp Phe Val Val He Tyr Asp Asp Pro Val 
145 150 155 160 

Ser Tyr Pro Trp He Ser Thr Ser Glu Lys Glu Tyr He He Ser Ser 

165 170 175 

Leu Lys Gin Gin Val Gly Ser Ser Lys Gin Pro Leu Pro He Lys Ala 

180 185 190 

Met Leu Arg Ser Leu Pro He Trp Ser He Cys Leu Gly Cys Phe Ser 

195 200 205 

His Gin Trp Leu Val Ser Thr Met Val Val Tyr He Pro Thr Tyr He 

210 215 220 

Ser Ser Val Tyr His Val Asn He Arg Asp Asn Gly Leu Leu Ser Ala 
225 230 235 240 

Leu Pro Phe He Val Ala Trp Val He Gly Met Val Gly Gly Tyr Leu 

245 250 255 

Ala Asp Phe Leu Leu Thr Lys Lys Phe Arg Leu He Thr Val Arg Lys 

260 265 270 

He Ala Thr He Leu Gly Ser Leu Pro Ser Ser Ala Leu He Val Ser 

275 280 285 

Leu Pro Tyr Leu Asn Ser Gly Tyr He Thr Ala Thr Ala Leu Leu Thr 

290 295 300 

Leu Ser Cys Gly Leu Ser Thr Leu Cys Gin Ser Gly He Tyr He Asn 
305 310 315 320 

Val Leu Asp He Ala Pro Arg Tyr Ser Ser Phe Leu Met Gly Ala Ser 

325 330 335 

Arg Gly Phe Ser Ser He Ala Pro Val He Val Pro Thr Val Ser Gly 

340 345 350 

Phe Leu Leu Ser Gin Asp Pro Glu Phe Gly Trp Arg Asn Val Phe Phe 

355 360 365 

Leu Leu Phe Ala Val Asn Leu Leu Gly Leu Leu Phe Tyr Leu He Phe 

370 375 380 

Gly Glu Ala Asp Val Gin Glu Trp Ala Lys Glu Arg Lys Leu Thr Arg 
385 390 395 400 

Leu 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1643 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



{vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



AGAACGGTGA GGATGACCGA CGTATAGGCG AGAGCCTAGG TACGCCATGC CAGGTCACCG 60 

GTCCGGCAAT TCCCGGGTCG ACCCACGCGT CCGCTTGGAG GGACGCTGGG TTCAACTTGA 120 

AGCCCTTCCA CAGACATTAA GTCGGTGAAA ACCATTCACT AGGAGAGGAG AAACACAATG 180 

GCCACCAAGA CAGAGTTGAG TCCCACAGCA AGGGAGAGCA AGAACGCACA AGATATGCAA 240 

GTGGATGAGA CACTGATCCC CAGGAAAGTT C C AAGTTT AT GTTCTGCTCG CTATGGAATA 300 

GCCCTCGTCT TACATTTCTG CAATTTCACA ACGATAGCAC AAAATGTCAT CATGAACATC 360 

ACCATGGTAG CCATGGTCAA CAGCACAAGC CCTCAATCCC AGCTCAATGA TTCCTCTGAG 42 0 

GTGCTGCCTG TTG AC TC ATT TGGTGGCCTA AGTAAAGCCC CAAAGAGTCT TCCTGCAAAG 48 0 

TCCTCAATAC TTGGGGGTCA GTTTGCAATT TGGGAAAGGT GGGGCCCTCC ACAAGAACGA 540 

AGCAGACTCT GCAGCATTGC TTTATCAGGA ATGTTACTGG GATGCTTTAC TGCCATCCTC 600 

ATAGGTGGCT TCATTAGTGA AACCCTTGGG TGGCCCTTTG TCTTCTATAT CTTTGGAGGT 660 

GTTGGCTGTG TCTGCTGCCT TCTCTGGTTT GTTGTGATTT ATGATG AC C C CGTTTCCTAT 72 0 

CCATGGATAA GCACCTCAGA AAAAGAATAC ATCATATCCT CCTTGAAACA ACAGGTCGGG 780 

TCTTCTAAGC AGCCTCTTCC CATCAAAGCT ATGCTCAGAT CTCTACCCAT TTGGTCCATA 840 

TGTTTAGGCT GTTTCAGCCA TCAATGGTTA GTTAGCACAA TGGTTGTATA CATACCAACT 900 

TACATCAGCT CTGTGTACCA TGTTAACATC AGAGACAATG GACTTCTATC TGCCCTTCCT 960 

TTTATTGTTG CCTGGGTCAT AGGCATGGTG GGAGGC TATC TGGCAGATTT CCTTCTAACC 1020 

AAAAAGTTTA GACTCATCAC TGTGAGGAAA ATTGCCACAA TTTTAGGAAG TCTCCCCTCT 1080 

TCAGCACTCA TTGTGTCTCT GCCTTACCTC AATTCCGGCT ATATCACAGC AACTGCCTTG 114 0 

CTGACGCTCT CTTGCGGATT AAGCACATTG TGTCAGTCAG GGATTTATAT CAATGTCTTA 1200 

GATATTGCTC CAAGGTATTC CAGTTTTCTC ATGGGAGCAT CAAGAGGATT TTCGAGCATA 1260 

GCACCTGTCA TTGTACCCAC TGTCAGCGGA TTTCTTCTTA GTCAGGACCC TGAGTTTGGG 1320 

TGGAGGAATG TCTTCTTCTT GCTGTTTGCC GTTAACCTGT TAGGACTACT CTTCTACCTC 13 80 

ATATTTGGAG AAGCAGATGT C C AAGAATGG GCTAAAGAGA GAAAACTCAC TCGTTTATGA 1440 

AGTTATC CCA CCTTGGATGG AAAAGTCATT AGGCACCGTA TTGCATAAAA TAGAAGGCTT 1500 

CCGTGATGAA AATAC C AGTG AAAAGATTTT TTTTTCCTGT GGCTCTTTTC AATTATGAGA 1560 

TCAGTTCATT ATTTTATTCA GACTTTTTTT TGAGAGAAAT GTAAGATGAA TAAAAATTCA 1620 

AATAAAATGA TAACTAAGAA TGC 1643 

(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 450532 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Gin Met Asp Asn Arg Leu Pro Pro Lys Lys Val Pro Gly Phe Cys 

1 5 10 15 

Ser Phe Arg Tyr Gly Leu Ser Phe Leu Val His Cys Cys Asn Val lie 

20 25 30 

He Thr Ala Gin Arg Ala Cys Leu Asn Leu Thr Met Val Val Met Val 
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Asn Ser Thr Asp Pro His Gly Leu Pro Asn Thr Ser Thr Lys Lys Leu 

50 55 60 

Leu Asp Asn He Lys Asn Pro Met Tyr Asn Trp Ser Pro Asp He Gin 
65 70 75 80 

Gly He He Leu Ser Ser Thr Ser Tyr Gly Val He He He Gin Val 

85 90 95 

Pro Val Gly Tyr Phe Ser Gly He Tyr Ser Thr Lys Lys Met He Gly 

100 105 HO 

Phe Ala Leu Cys Leu Ser Ser Val Leu Ser Leu Leu He Pro Pro Ala 

115 120 125 

Ala Gly He Gly Val Ala Trp Val Val Val Cys Arg Ala Val Gin Gly 

130 135 140 

Ala Ala Gin Gly He Val Ala Thr Ala Gin Phe Glu He Tyr Val Lys 
145 150 155 160 

Trp Ala Pro Pro Leu Glu Arg Gly Arg Leu Thr Ser Met Ser Thr Ser 

165 170 175 

Gly Phe Leu Leu Gly Pro Phe He Val Leu Leu Val Thr Gly Val He 

180 185 190 

Cys Glu Ser Leu Gly Trp Pro Met Val Phe Tyr He Phe Gly Ala Cys 

195 200 205 

Gly Cys Ala Val Cys Leu Leu Trp Phe Val Leu Phe Tyr Asp Asp Pro 

210 215 220 

Lys Asp His Pro Cys He Ser He Ser Glu Lys Glu Tyr He Thr Ser 
225 230 235 240 

Ser Leu Val Gin Gin Val Ser Ser Ser Arg Gin Ser Leu Pro He Lys 

245 250 255 

Ala He Leu Lys Ser Leu Pro Val Trp Ala He Ser He Gly Ser Phe 

260 265 270 

Thr Phe Phe Trp Ser His Asn He Met Thr Leu Tyr Thr Pro Met Phe 

275 280 285 

He Asn Ser Met Leu His Val Asn He Lys Glu Asn Gly Phe Leu Ser 

290 295 300 

Ser Leu Pro Tyr Leu Phe Ala Trp He Cys Gly Asn Leu Ala Gly Gin 
305 310 315 320 

Leu Ser Asp Phe Phe Leu Thr Arg Asn He Leu Ser Val He Ala Val 

325 330 335 

Arg Lys Leu Phe Thr Ala Ala Gly Phe Leu Leu Pro Ala He Phe Gly 

340 345 350 

Val Cys Leu Pro Tyr Leu Ser Ser Thr Phe Tyr Ser He Val He Phe 

355 360 365 

Leu He Leu Ala Gly Ala Thr Gly Ser Phe Cys Leu Gly Gly Val Phe 

370 375 380 

He Asn Gly Leu Asp He Ala Pro Arg Tyr Phe Gly Phe He Lys Ala 
385 390 395 400 

Cys Ser Thr Leu Thr Gly Met He Gly Gly Leu He Ala Ser Thr Leu 

405 410 415 

Thr Gly Leu He Leu Lys Gin Asp Pro Glu Ser Ala Trp Phe Lys Thr 

420 425 430 

Phe He Leu Met Ala Ala He Asn Val Thr Gly Leu He Phe Tyr Leu 

435 440 445 

He Val Ala Thr Ala Glu He Gin Asp Trp Ala Lys Glu Lys Gin His 

450 455 460 

Thr Arg Leu 
465 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 560 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 507415 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Glu Phe Arg Gin Glu Glu Phe Arg Lys Leu Ala Gly Arg Ala Leu 

1 5 10 15 

Gly Arg Leu His Arg Leu Leu Glu Lys Arg Gin Glu Gly Ala Glu Thr 

20 25 30 

Leu Glu Leu Ser Ala Asp Gly Arg Pro Val Thr Thr His Thr Arg Asp 

35 40 45 

Pro Pro Val Val Asp Cys Thr Cys Phe Gly Leu Pro Arg Arg Tyr lie 

50 55 60 

lie Ala lie Met Ser Gly Leu Gly Phe Cys He Ser Phe Gly He Arg 
65 70 75 80 

Cys Asn Leu Gly Val Ala He Val Ser Met Val Asn Asn Ser Thr Thr 

85 90 95 

His Arg Gly Gly His Val Val Val Gin Lys Ala Gin Phe Asn Trp Asp 

100 105 HO 

Pro Glu Thr Val Gly Leu He His Gly Ser Phe Phe Trp Gly Tyr He 

115 120 125 

Val Thr Gin He Pro Gly Gly Phe He Cys Gin Lys Phe Ala Ala Asn 

130 135 140 

Arg Val Phe Gly Phe Ala He Val Ala Thr Ser Thr Leu Asn Met Leu 
145 150 155 160 

He Pro Ser Ala Ala Arg Val His Tyr Gly Cys Val He Phe Val Arg 

1S5 170 175 

He Leu Gin Gly Leu Val Glu Gly Val Thr Tyr Pro Ala Cys His Gly 

180 185 190 

He Trp Ser Lys Trp Ala Pro Pro Leu Glu Arg Ser Arg Leu Ala Thr 

195 200 205 

Thr Ala Phe Cys Gly Ser Tyr Ala Gly Ala Val Val Ala Met Pro Leu 

210 215 220 

Ala Gly Val Leu Val Gin Tyr Ser Gly Trp Ser Ser Val Phe Tyr Val 
225 230 235 240 

Tyr Gly Ser Phe Gly He Phe Trp Tyr Leu Phe Trp Leu Leu Val Ser 

245 250 255 

Tyr Glu Ser Pro Ala Leu His Pro Ser He Ser Glu Glu Glu Arg Lys 

260 265 270 

Tyr He Glu Asp Ala He Gly Glu Ser Ala Lys Leu Met Asn Pro Val 

275 280 285 

Thr Lys Phe Asn Thr Pro Trp Arg Arg Phe Phe Thr Ser Met Pro Val 

290 295 300 

Tyr Ala He He Val Ala Asn Phe Cys Arg Ser Trp Thr Phe Tyr Leu 
305 310 315 320 

Leu Leu He Ser Gin Pro Ala Tyr Phe Glu Glu Val Phe Gly Phe Glu 

325 330 335 

He Ser Lys Val Gly Leu Val Ser Ala Leu Pro His Leu Val Met Thr 

340 345 350 

He He Val Pro He Gly Gly Gin He Ala Asp Phe Leu Arg Ser Arg 

355 360 365 

His He Met Ser Thr Thr Asn Val Arg Lys Leu Met Asn Cys Gly Gly 

370 375 380 

Phe Gly Met Glu Ala Thr Leu Leu Leu Val Val Gly Tyr Ser His Ser 
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385 

Lys Gly Val Ala 

Phe Ala lie Ser 
420 

Tyr Ala Ser lie 
435 

Gly Met Val Cys 
450 

Arg Glu Glu Trp 
465 

Gly Gly Val lie 

Trp Ala Glu Pro 
500 

His Asp Gin Leu 
515 

Glu Pro Pro Gly 
530 

His Ser Thr Val 
545 



390 

lie Ser Phe Leu 
405 

Gly Phe Asn Val 

Leu Met Gly lie 
440 

Pro lie lie Val 
455 

Gin Tyr Val Phe 
470 

Phe Tyr Gly Val 
485 

Glu Glu Met Ser 

Ala Gly Ser Asp 
520 

Ala Pro Pro Ala 
535 

Gin Pro Pro Arg 
550 



395 

Val Leu Ala Val 
410 

Asn His Leu Asp 
425 

Ser Asn Gly Val 

Gly Ala Met Thr 
460 

Leu He Ala Ser 
475 

Phe Ala Ser Gly 
490 

Glu Glu Lys Cys 
505 

Glu Ser Glu Met 

Pro Pro Pro Ser 
540 

Pro Pro Pro Pro 
555 



400 

Gly Phe Ser Gly 
415 

He Ala Pro Arg 
430 

Gly Thr Leu Ser 
445 

Lys His Lys Thr 

Leu Val His Tyr 
480 

Glu Lys Gin Pro 
495 

Gly Phe Val Gly 
510 

Glu Asp Glu Val 
525 

Tyr Gly Ala Thr 

Val Arg Asp Tyr 
560 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATTTATATCA ATGTCTTAGA TATTGCTCCA AGGTATTCCA GTTTTCTCAT GGGAGCATCA 6 0 

AGAGGATTTT CGAGCATAGC AC C TGTC ATT GTACCCACTG TCAGTGGATT TCTTCTTAGT 120 

CAGGACCCTG AGTTTGGGTG GAGGAATGTC TTCTTCTTGC TGTTTGCCGT TAACCTGTTA 180 

GGACTACTCT TCTACCTCAT ATTTGGAGAA GCAGATGTCC AAGAATGGGC TAAAGAGAGA 240 

AAACTCACTC GTTTATGAAG TTATCCCACC TT 272 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: XLR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CTTGATGCTC C C ATGAG AAA ACTGG 25 



( 2 ) INFORMATION FOR SEQ ID NO : 7 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: XLF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AGGATTTTCG AGCATAGCAC CTGTC 



